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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: MEDIGEN SA - do LENZ and STAHEUN 

(B) STREET: 25 Grand Rue 

(C) CITY: GENEVA 

(E) COUNTRY: SWITZERLAND 

(F) POSTAL CODE (ZIP): CH 121 1 



(ii) TITLE OF INVENTION: METHODS FOR DIAGNOSIS AND THERAPY OF 
AUTOIMMUNE DISEASE, SUCH AS INSULIN DEPENDENT DIABETES 
MELLITUS, INVOLVING RETROVIRAL SUPERANTIGENS 



(iii) NUMBER OF SEQUENCES: 46 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(v) CURRENT APPLICATION DATA: 
APPLICATION NUMBER: PCT/EP 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA (synthetic) - 



(ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION:!. .25" 

(D) OTHER INFORMATION: /note- "page 11" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
TTTTTGAGTC CCCTTAGTAT TTATT 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "Synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION:!. .20" 

(D) OTHER INFORMATION :/note« " 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATC CAACAAC CATGATGGAG 

.20 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA" 
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(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .21 

(D) OTHER INFORMATION : /note» "page 26" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TCTCGTAAGG TGCAAATGAA G 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) . MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA W 



(ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION: 1. .21" 

(D) OTHER INFORMATION: /note= "PAGE 26" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTAAAGGATC AAGTGCTGTG C 



(2) INFORMATION FOR SEQ ID NO: 5; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CTTTACAAAG CAGTATTGCT GC 



22 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: . 

(A) NAME/KEY: misc feature 

(B) LOCATION; 1. .23 

(D) OTHER INFORMATION :/note= "RT la page 50' 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
YAAATGGMGW AYGYTAACAG ACT 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA" 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION : 1 . .23 

(D) OTHER INFORMATION: /note- "RT lb page 51 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
YAAATGGMGW AYGYTAACTG ACT 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH r 28 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



# 
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<ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION : 1 . . 2 8 

(D) OTHER INFORMATION: /note= "RT 2a-nested page 50" 



CGTCTAGAGC CTCTCCGGCA TGATCCCG 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION : 1 , .28 

(D) OTHER INFORMATION : / note= "RT 2b-nested page 50" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CGTCTAGAGC CTCTCCGGCA TGATCCCA 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(ix) FEATURE: 

(A> NAME/ KEY t misc_f eature 

(B) LOCATION :1. .21 

(D) OTHER INFORMATION: /note- "Common 5' anchor page 50" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



TGCGCCAGCA ATGTAT CCAT G 



21 
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(2) INFORMATION FOR SEQ „ N0: Uj 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 21 base ~«f' 

(B) typp. _ , ease pairs 

(ii) MOLECULE TYPE- „i-k~ 

(A) MScSSioS^**/^ 61 * 16 aCid 

/desc = "SYNTHETIC DMA" 

(ix) FEATURE: 

(A) NAME/KEY: mi-,- * 

W LOCATION ,l!?£- f " tU ^ 

«» OTHER INFORMATION : /note= page M> 

SEQUENCE DESCRIPTION: SEQ I DN0: u . 
GGGTGGCAGT G CAT CAT AG G T 

(2) INFORMATION FOR SEQ „ NO: 12j 

(i) S fWENCE CHARACTERISTICS- 

(A) LENGTH • 70 k> 

(C) STRANDEDNeIs slnoJ 
fD) TOPOLOGY: lf ne « ^ 

Ui) MOLECULE TYPE- oth . 

(A) DESCRlSioN^ r /2 UCleiC acid 

<ix) FEATURE: 

(A) NAME/ KEY- m _ 

'B> LOCATlSil^H-^""" 

° THER ~-ON:/ note= , K1>2 . 4 

<X±) SE ° UENCE ^ RI PTI0N; SEQ ID N0: 12 . 
GGGAGAGGGT CAGCAGCAGA CA 

<2) INFORMATION FOR SEQ ID NO; 13; 

fi) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 22 hT. CS: 

W TOPOLOGY: "SiJi"* 1 * 
(ii) MOLECULE TYPE- n rh 

(A) OESCRlfTio" er /2 UCleiC acid 

/desc = "SYNTHETIC DNA" 



21 



22 



7/27 



(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION : 1 . .22 

(D) OTHER INFORMATION: /note= B K1,2-10 page 51" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GACAGCAAGC CAGTGATAAG CA 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME /KEY: mis cofeature 

(B) LOCATION: 1. .19 

(D) OTHER INFORMATION :/note= W K1,2-16 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGAACAGGGA CTCTCTGCA 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION:!. .20 

(D) OTHER INFORMATION :/note= "Kl,2-17 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:- 15: 
GGGAAGGGTA AGGAAGTGTG 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION : 1 . .19 

(D) OTHER INFORMATION : /note 3 "Kl,2-22 page 51" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGTGTTTCTC CTGAGGGAG 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE : 

(A) NAME/ KEY: misc feature 

(B) LOCATION: 1 - . 21 

(D) OTHER INFORMATION :/note= "Kl,2-2 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: - 
GAAGAATGGC CAACAGAAGC T 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
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(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION : 1 . .20 

(D) OTHER INFORMATION : /note 55 w Kl,2-27 page 51" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GGGAAACAAG GAGTGTGAGT 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DMA" 



(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION: 1. . 39 

(D) OTHER^ INFORMATION : / note 8 w U3-R-poly (AS ) common page 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CATGTATAT G CGGCCGCTGC GCCAGCAATG TATCCATGG 39 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 
. (B) LOCATION : 1 .> „ 2 1 

(D) OTHER INFORMATION : /note a "RTl page 51" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



TATCTTTCGT TTCTGCAGCA C 



21 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION: 1. .22 

(D) OTHER INFORMATION: /note= "RT2 page 51" 



(xi) SEQUENCE DESCRIPTION: . SEQ ID NO: 21: 
TAACTGGTTG AAGAGCTCGA CC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION : 1 . . 21 

(D) OTHER INFORMATION: /note= "R-U5-1 page 51" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
ATACTAAGGG GACTCAGAGG C 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
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(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION :1. .27 

(D) OTHER INFORMATION :/note= "R-U5-2 page 51" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CAGAGGCTGG TGGGATCCTC CATATGC 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC DNA" 



(ix) FEATURE: 

(A) NAME/KEY: raise feature 

(B) LOCATION : 1 . .25 

(D) OTHER INFORMATION :/note= "page 52 w 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TTTTTGAGTC CCCTTAGTAT TTATT 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/ KEY : misc feature 

(B) LOCATION :1.. 22~ 

(D) OTHER INFORMATION : / note 23 "Page 52" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



AGGTATTGTC CAAGGTTTCT CC 



22 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "SYNTHETIC DNA" 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION : 1 . .22 

(D) OTHER INFORMATION:/ no te= "page 52" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CTTTACAAAG CAGTATTGCT GC 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .21 

(D) OTHER INFORMATION :/note= "page 52" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GTAAAGGATC AAGTGCTGTG C 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) * LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
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(ix) FEATURE: 

(A) NAME/ KEY : mis cofeature 

(B) LOCATION : 1 . .29 

(D) OTHER INFORMATION: /note= "page 53" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GACTAAGCTT AAGAACC CAT CAGAGATGC 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: Vdesc = "synthetic DNA W 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. .31 

(D) OTHER INFORMATION: /note- "page 53" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AGACTGGATC CGTTAAGTCG CTATCGACAG C 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "RETROVIRAL DNA" 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: I. .208 

(D) OTHER INFORMATION :/note= "FIGURE 7A" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CATCTCCCTC AGGAGAAACA CCCACGAATG AT CAAT AAAT ACTAAGGGGA CTCAGAGGCT 
GGTGGGATCC TCCATATGCT GAACGTTGGT TCCCGGGGCC CCCTTATTTC TTTCTCTATA 
CTTTGTCTCT • GTGTCTTTTT CTTTTCCAAG TCTTCTTCAT TTGCACCTTA CGAGAAACAT 
CTCCATCATG GTTGTTGGAT GGGGGCAA 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "RETROVIRAL DNA ,# 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1. . 1060 

(D) OTHER INFORMATION:/ not e= "FIGURE 7B M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CTGCAGGTGT ACCCAACAGC TCCGAAGAGA CAGTGACATC GAGAACGGGC CATGATGACG 
ATGGCGGTTT TGTCGAAAAG AAAAGGGGGA AATGT GGGGA AAAGCAAGAG AGATGAGATT 
GTTACTGTGT CTGTATAGAA AGAAGTAGAC ATAGGAGACT CCATTTTGTT CTGTACTAAG 
AAAAATTCTT CTGCCTTGAG AT GCT GTTAA TCTATGACCT TACCCCCAAC CCCGTGCTCT 
CTGAAACATG TGCCGTGTCA AACTCAGGGT TAAATGGATT AAGGGTGGTG CAAGATGTGC 
TTT GTTAAAC AGATGCTTGA AGGCAGCATG CTCATTAAGA GTCATCACCA CTCCCTAATC 
TCAAGTACCC AGGGACACAA ACACT GCGAA AGGCCGCAGG GACCTCTGCC TAGGAAAGCC 
AGGTATTGTC CAAGGTTTCT CCCCATGTGA TAGTCTGAAA TATGGCCTCG TGGGAAGGGA 
AAGACCT GAC CATCCCCCAG ACCAACACCC GTAAAGGGTC TGTGCTGAGG AG GATT AGT A 
TAAGAGGAAA GCATGCCTCT TGCAGTTGAG AGAAGAGGAA GACATCTGTC TCCTGCCCAT 
CCCCTGGGCA ATGGAATGTC TCAGTATAAA ACCCGATTGA ACATT C CAT C TACTGAGATA 
GGGAAAAACT GCCTTAGGGC TGGAGGTGGG ACATGTGGGC AGCAATACTG CTTTGTAAAG 
CATTGAGATG TTTATGTGTA TGTATATCTA AAAGCACAGC ACTTGATCCT TTACCTTGTC 
TAT GAT G CAA ACACCTTTGT TCACGTGTTT GTCTGCTGAC CCTCTCCCCA CTATTGTCTT 
GTGACCCTGA CACATCTCCC TCAGGAGAAA CACCCACGAA T GAT CAAT AA ATACTAAGGG 
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GACTCAGAGG CTGGTGGGAT CCTCCATATG CTGAACGTTG GTTCCCGGGG CCCCCTTATT 960 
TCTTTCTCTA TACTTTGTCT CTGTGTCTTT TTCTTTTCCA AGTCTTCTTQ. ATTTGCACCT 1020 
T AC GAGAAAC ATCTCCATCA TGGTTGTTGG ATGGGGGCAA 1060 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "RETROVIRAL, DNA M 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. . 1754 

(D) OTHER INFORMATION: /not e= "FIGURE 7C" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ATGGTAACAC CAGTCACATG GATGGATAAT CCTATAGAAG TATAT GTTAA TGATAGTGTA . 60 

TGGGTACCTG GCCCCACAGA TGATCGCTGC CCTGCCAAAC CTGAGGAAGA AGGGATGATG 120 

ATAAATATTT CCATTGGGTA TCATTATCCT CCTATTTGCC TAGGGAGAGC AC CAGGATGT 180 

TTAATGCCTG CAGTCCAAAA TTGGTTGGTA GAAGTACCTA CTGTCAGTCC TAACAGTAGA 240 

TTCACTTATC ACAT GGTAAG CGGGATGTCA CTCAGGCCAC GGGTAAATTA TTTACAAGAC 300 

TTTTCTTATC AAAGAT CATT AAAATTTAGA CCTAAAGGGA AAACTTGCCC CAAGGAAATT 360 

CCTAAAGGAT CAAAGAATAC AGAAGTTTTA GTTTGGGAAG AATGTGTGGC CAATAGTGTG 420 

GTGATATTAC AAAACAATGA ATTCGGAACT ATTATAGATT AGGCACCTCG AGGTCAATTC 480 

TACCACAATT GCTCAGGACA AACTCAGTCG TGTCCAAGTG CACAAGTGAG TCCAGCTGTC 540 

GATAGCGACT TAACAGAAAG TCTAGACAAA CAT AAG CAT A AAAAATTACA GTCTTTCTAC 600 

CTTTGGGAAT GGGAAGAAAA AGGAATCTCT ACCCCAAGAC CAAAAATAAT AAGTCCTGTT 660 

TCTGGTCCTG AACATCCAGA ATTGTGGAGG CTTACTGTGG CCTCACACCA CATTAGAATT 720 

TGGTCTGGAA ATCAAACTTT AGAAACAAGA TATCGTAAGC CATTTTATAC TATCGACCTA 780 

AATTCCATTC TAACGGTTCC TTTACAAAGT TGCCTAAAGC CCCCTTATAT GCTAGTTGTA 840 

GGAAATATAG TTATTAAACC AGCCTCCCAA ACTATAACCT GTGAAAATTG TAGATTGTTT 900 

ACTTGCATTG ATTCAACTTT TAATTGGCAG CACCGTATTC TGCTGGTGAG AGCAAGAGAA 960 

GGCATGTGGA TCCCTGTGTC CACGGACCGA CCGTGGGAGG CCTCGCCATC CATCCATATT 1020 
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TTGACTGAAA TATTAAAAGG CGTTTTAAAT AGATCCAAAA GATTCATTTT TACTTTAATT 1080 

G CAGTGATTA TGGGATTAAT TGCAGTCACA GCTACGGCTG CTGTGGCAGG GGTTGCATTG 1140 

CACTCTTCTG TTCAGTCAGT AAACTTTGTT AATTATTGGC AAAAGAATTC TACAAGATTG *1200 

TGGAATTCAC AAT CTAGTAT TGATCAAAAA TTGGCAAGTC AAATTAATGA TCTTAGACAA 1260 

ACTGTCATTT GGATGGGAGA CAGGCTTGAC TTAGAACATC ATTTCCAGTT ACAGTGTGAC 1320 

TGGAATACGT CAGATTTTTG TATTACACCC CAAATTTATA ATGAGTCTGA GCATCACTGG 1380 

GACATGGTTA GACGCCATCT ACAGGGAAGA GAAGAT AAT C TCACTTTAGA CATTTCCAAA 14 40 

TTAAAAGAAC AAATTTTCGA AGCATCAAAA GCCCATTTAA ATTTGGTGCC AGGAACTGAG 1500 

GCAATTGCAG GAGTTGCTGA TGGCCTCGCA AAT CTTAAC C CTGTCACTTG GATTAAGACC 1560 

AT CAGAAGTA CTATGATTAT AAATCT CAT A TTAATCGTTG TGTGCCTGTT TTGTCTGTTG 1620 

TTAGTCTGCA GGTGTACCCC AACAGCTCCG AAAAAAACAG TGACATCGAG AACGGGCCAT 1680 

GAATGACAAA GGCGGTTTTT GTTCCAAAAA AAAAAGGGGG AAATTTTGGG GAAAACCAAA 1740 
AAAATGAAAA TGTT 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "RETROVIRAL DNA" 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1. .520 

(D) OTHER INFORMATION : / note=» " FIGURE 7D" 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 59. .517 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
ACATTTGAAG TTCTACAATG AACCCATCAG AGAT GCAAAG AAAAGCGCCT CCACGGAG ' 58 

m T ^ ^ CCA GTC ACA TGG ATG ^ T CCT AT * GAA GTA TAT GTT 

Met Val Thr Pro Val Thr Trp Met Asp Asn Pro He Glu Val Tyr Val 

15 10 15 

AAT GAT AGT GTA TGG GTA CCT GGC CCC ACA GAT GAT CGC TGC CCT GCC 154 
Asn Asp Ser Val Trp Val Pro Gly Pro Thr Asp Aso Arg Cys Pro Ala 
20 25 * 30 



106 
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AAA CCT GAG GAA GAA GGG ATG ATG ATA AAT ATT TCC ATT GGG TAT CAT 202 
Lys Pro Glu Glu Glu Gly Met Met lie Asn He Ser He Gly Tyr His 
35 40 45 

TAT CCT CCT ATT TGC CTA GGG AGA GCA CCA GGA TGT TTA ATG CCT GCA " 250 
- Tyr Pro Pro lie Cys Leu Gly Arg Ala Pro Gly Cys Leu Met Pro Ala 
50 55 60 

GTC CAA AAT TGG TTG GTA GAA GTA CCT ACT GTC ACT CCT AAC AGT AGA 298 
Val Gin Asn Trp Leu Val Glu Val Pro Thr Val Ser Pro Asn Ser Ara 
« 70 75 80 

TTC ACT TAT CAC ATG GTA AGC GGG ATG TCA CTC AGG CCA CGG GTA AAT 346 
Phe Thr Tyr His Met Val Ser Gly Met Ser Leu Arg Pro Arg Val Asn 
85 90 95 

TAT TTA CAA GAC TTT TCT TAT CAA AGA TCA TTA AAA TTT AGA CCT AAA 394 
Tyr Leu Gin Asp Phe Ser Tyr Gin Arg Ser Leu Lys Phe Arg Pro Lys 
100 105 110 

GGG AAA ACT TGC CCC AAG GAA ATT CCT AAA GGA TCA AAG AAT ACA GAA 442 
Gly Lys Thr Cys Pro Lys Glu He Pro Lys Gly Ser Lys Asn Thr Glu 
115 120 12 5 

GTT TTA GTT TGG GAA GAA TGT GTG GCC AAT AGT GTG GTG ATA TTA CAA 490 
Val Leu Val Trp Glu Glu Cys Val Ala Asn Ser Val Val He Leu Gin 
130 135 140 



AAC AAT GAA TTC GGA ACT ATT ATA GAT TAG 
Asn Asn Glu Phe Gly Thr He He Asp 
145 150 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Val Thr Pro Val Thr Trp Met Asp Asn Pro He Glu Val Tyr Val 
1 5 io is 

Asn Asp Ser Val Trp Val Pro Gly Pro Thr Asp Asp Arg Cys Pro Ala 
20 25 30 

Lys Pro Glu Glu Glu Gly Met Met He Asn He Ser He Gly Tyr His 
35 40 . 45 

Tyr Pro Pro He Cys Leu Gly Arg Ala Pro Gly Cys Leu Met Pro Ala 
50 55 60 

Val Gin Asn Trp Leu Val Glu Val Pro Thr Val Ser Pro Asn Ser Arg 
■ 65 . 70 75 80 

Phe Thr Tyr His Met Val Ser Gly Met Ser Leu Arg Pro Arg Val Asn 
85 90 95 



520 
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Tyr Leu Gin Asp Phe Ser Tyr Gin Arg Ser Leu Lys Phe Arg Pro Lys 
. 100 105 HO 

Gly Lys Thr Cys Pro Lys Glu lie Pro Lys Gly Ser Lys Asn Thr Glu 
H* 120 125 

Val Leu- Val Trp Glu Glu Cys Val Ala Asn Ser Val Val He Leu Gin 
130- 135 " 140 

Asn Asn Glu Phe Gly Thr He He Asp 
145 150 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 603 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "retroviral DNA" 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1. . 603 

(D) OTHER INFORMATION: /note= H FIGURE 7E W 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ACATTTGAAG TTCTACAATG AACCCATCAG AGATGCAAAG AAAAGCGCCT CCACGGAGAT 60 

GGTAACACCA GTCACATGGA TGGATAATCC TATAGAAGTA TAT GTTAATG ATAGTGTATG 120 

GGTACCTGGC CCCACAGATG ATCGCTGCCC TGCCAAACCT GAGGAAGAAG G GAT GAT GAT 180 

AAATATTTCC ATTGGGTATC ATTATCCTCC TATTTGCCTA GGGAGAGCAC CAGGATGTTT 24 0 

AATGCCTGCA GTCCAAAATT GGTTGGTAGA AGTACCTACT GTCAGTCCTA ACAGTAGATT 300 

CACTTATCAC ATGGTAAGCG GGATGTCACT CAGGCCACGG GTAAATTATT TACAAGACTT 360 

TTCTTATCAA AGAT CATTAA AATTTAGACC TAAAGGGAAA ACTTGCCCCA AGGAAATTCC 420 

TAAAGGATCA AAGAATACAG AAGTTTTAGT TTGGGAAGAA TGTGTGGCCA ATAGT GTGGT 4 80 

GATATTACAA AACAATGAAT TCGGAACTAT TATAGATTAG GCACCTCGAG GTCAATTCTA 54 0 

CCACAATTGC TCAGGACAAA CTCAGTCGTG TCGAAGTGCA CAAGTGAGT C CAGCTGTCGA 600 
TAG 

603 
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) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/ KEY: Protein 

(B) . LOCATION: 1. .561 

(D) OTHER INFORMATION: /note= "Figure 7F W 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Val Thr Pro Val Thr Trp Met Asp Asn Pro lie Glu Val Tyr Val 
1 5 10 15 

Asn Asp Ser Val Trp Val Pro Gly Pro Thr Asp Asp Arg Cys Pro Ala 
20 .25 30 

Lys Pro Glu Glu Glu Gly Met Met He Asn lie Ser He Gly Tvr His 
35 40 45 

Tyr Pro Pro He Cys Leu Gly Arg Ala Pro Gly Cys Leu Met Pro Ala 
50 55 60 

Val Gin Asn Trp Leu Val Glu Val Pro. Thr Val Ser Pro Asn Ser Arq 
65 70 75 80 

Phe Thr Tyr His Met Val Ser Gly Met Ser Leu Arg Pro Arg Val Asn 
85 90 95 

Tyr Leu Gin Asp Phe Ser Tyr Gin Arg Ser Leu Lys Phe Arg Pro Lys 
100 105 no 

Gly Lys Thr Cys Pro Lys Glu He Pro Lys Gly Ser -Lys Asn Thr Glu 
115 120 125 

Val Leu Val Trp Glu Glu Cys Val Ala Asn Ser Val Val He Leu Gin 
130 135 i 4 o 

Asn Asn Glu Phe Gly Thr He He Asp Glx Ala Pro Arg Gly Gin Phe 
145 150 155 160 

Tyr His Asn Cys Ser Gly Gin Thr Gin Ser Cys Pro Ser Ala Gin Val 
i65 170 175 

Ser Pro Ala Val Asp Ser Asp Leu Thr Glu Ser Leu Asp Lys His Lys 
180 185 • 190 

His Lys Lys Leu Gin Ser Phe Tyr Leu Trp Glu Trp Glu Glu Lys Gly 
I 95 200 205 

He Ser Thr Pro Arg Pro Lys He He Ser Pro Val Ser Gly Pro Glu 
21° 215 220 

His Pro Glu Leu Trp Arg Leu Thr Val Ala Ser His His He Arg He 
225 230 235 240 
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Trp Ser Gly Asn Gin Thr Leu Glu Thr Arg Tyr Arg Lys Pro Phe Tyr 
245 250 255 

Thr He Asp Leu Asn Ser He Leu Thr Val Pro Leu Gin Ser Cys Leu 
260 265 270 

Lys Pro Pro Tyr Met Leu Val Val Gly Asn He Val He Lys Pro Ala 
275 280 285 

Ser Gin Thr He Thr Cys Glu Asn Cys Arg Leu Phe Thr Cys He Asp 
290 295 300 . 

Ser Thr Phe Asn Trp Gin His Arg He Leu Leu Val Arg Ala Arg Glu 

305 310 315 

Gly Met Trp He Pro Val Ser Thr Asp Arg Pro Trp Glu Ala Ser Pro 
325 330 335 

Ser He His He Leu Thr Glu He Leu Lys Gly Val Leu Asn Arg Ser 
340 345 350 

Lys Arg Phe He Phe Thr Leu He Ala Val He Met Gly Leu He Ala 
355 360 365 

Val Thr Ala Thr Ala Ala Val Ala Gly Val Ala Leu His Ser Ser Val 
370 375 380 

Gin Ser Val Asn Phe Val Asn Tyr Trp Gin Lys Asn Ser Thr Arg Leu 
385 390 395 4Q0 

Trp Asn Ser Gin Ser Ser He Asp Gin Lys Leu Ala Ser Gin lie Asn 
405 410 415 

Asp Leu Arg Gin Thr Val He Trp Met Gly Asp Arg Leu Asp Leu Glu 
42 0 425 430 



His His Phe Gin Leu Gin Cys Asp Trp Asn Thr Ser Asp Phe Cys He 
435 440 445 

Thr Pro Gin He Tyr Asn Glu Ser Glu His His Trp Asp Met Val Arg 
450 455 460 

Arg His Leu Gin Gly Arg Glu Asp Asn Leu Thr Leu Asp He Ser Lys 
465 47 ° 475 480 

Leu Lys Glu Gin lie Phe Glu Ala Ser Lys Ala His Leu Asn Leu Val 
485 4 90 49S 

Pro Gly Thr Glu Ala He Ala Gly Val Ala Asp Gly Leu Ala Asn Leu 
500 505 510 

Asn Pro Val Thr Trp He Lys Thr He Arg Ser Thr Met He He Asn 
51 5 520 525 

Leu lie Leu He Val Val Cys Leu Phe Cys Leu Leu Leu Val Cys Arg 
3JU 535 540 



Cys Thr Pro Thr Ala Pro Lys Lys Thr Val Thr Ser Arg Thr Gly His 
Glu 



545 550 555 - 560 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/ KEY: inisc_f eature 

(B) LOCATION: 1. .604 

(D) OTHER INFORMATION:/ not e= "FIGURE 7G" 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 59. .601 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

ACATTT GAAG TTCTACAATG AACCCATCAG AGATGCAAAG AAAAGCGCCT CCACGGAG 

ATG GTA ACA CCA GTC ACA TGG ATG GAT AAT CCT ATA GAA GTA TAT GTT 
Met Val Thr Pro Val Thr Trp Met Asp Asn Pro He Glu Val Tyr Val 
15 10 15 

AAT GAT AGT GTA TGG GTA CCT GGC CCC ACA GAT GAT CGC TGC CCT GCC 
Asn Asp Ser Val Trp Val Pro Gly Pro Thr Asp Asp Arg Cys Pro Ala 
20 25 30 

AAA CCT GAG GAA GAA GGG ATG ATG ATA AAT ATT TCC ATT GGG TAT CAT 
Lys Pro Glu Glu Glu Gly Met Met He Asn He Ser He Gly Tyr His 
35 40 45 

TAT CCT CCT ATT TGC. CTA GGG AGA GCA CCA GGA TGT TTA ATG CCT GCA 
Tyr Pro Pro He Cys Leu Gly Arg Ala Pro Gly Cys Leu Met Pro Ala 
50 55 60 

vl? ^ t** l GG 7 TG GTA GTA CCT ACT GTC AGT CCT AAC AGT AGA 

Val Gin Asn Trp Leu Val Glu Val Pro Thr Val Ser Pro Asn Ser Arg 

65 7 ° 75 80 

II C A 5 T ™ T ^ ATG GTA AGC GGG ATG T( =A CTC AGG CCA CGG GTA AAT 
Phe Thr Tyr His Met Val 3er Gly Met Ser Leu Arg Pro Arg Val Asn 
85 .90 95 

Ivl Hi <^ TTT TCT TAT CAA AGA TCA TTA AAA TTT AGA CCT AAA 

Tyr Leu Gin Asp Phe Ser Tyr Gin Arg Ser Leu Lys Phe Arg Pro Lys 
100 105 no 

GGG AAA ACT TGC CCC AAG GAA ATT CCT AAA GGA TCA AAG AAT ACA GAA 
Gly Lys Thr Cys Pro Lys Glu He Pro Lys Gly Ser £n i£ 2J 

120 125 

GTT TTA GTT TGG GAA GAA TGT GTG GCC AAT AGT GTG GTG ATA TTA CAA 
Val Leu Val Trp Glu Glu Cys Val Ala Asn Ser Val Val lie lit S£ 
±JU 1 35 140 



58 
106 

154 

202 

250 

298 

346 

394 

442 

490 
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AAC AAT GAA TTC GGA ACT ATT ATA GAT TTA GGC ACC TCG AGG TCA ATT 538 
Asn Asn Glu Phe Gly Thr lie lie Asp Leu Gly Thr Ser Arg Ser lie 
145 150 155 160 



CTA CCA CAA TTG CTC AGG ACA AAC TCA GTC GTG TCC AAG TGC ACA AGT 586 
Leu Pro Gin Leu Leu Arg Thr Asn Ser Val Val Ser Lys Cys Thr Ser 
165 170 175 



GAG TCC AGC TGT CGA TAG 
Glu Ser Ser Cys Arg 
180 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 181 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Val Thr Pro Val Thr Trp Met Asp Asn Pro He Glu Val Tyr Val 
1 5 10 15 

Asn Asp Ser Val Trp Val Pro Gly Pro Thr Asp Asp Arg Cys Pro Ala 
20 25 30 

Lys Pro Glu Glu Glu Gly Met Met He Asn He Ser He Gly Tyr His 
35- 40 45 

Tyr Pro Pro lie Cys Leu Gly Arg Ala Pro Gly Cys Leu Met Pro Ala 
50 55 eo 

Val Gin Asn Trp Leu Val Glu Val Pro Thr Val Ser Pro Asn Ser Aro 
65 70 75 80 

Phe Thr Tyr His Met Val Ser Gly Met Ser Leu .Arg Pro Arg Val Asn 
85 90 95 

Tyr Leu Gin Asp Phe Ser Tyr Gin Arg Ser Leu Lys Phe Arg Pro Lys 
100 105 no 

Gly Lys Thr Cys Pro Lys Glu He Pro Lys Gly Ser Lys Asn Thr Glu 
115 120 125 

Val Leu Val Trp Glu Glu Cys Val Ala Asn Ser Val Val He Leu Gin 
130 135 140 

Asn Asn Glu Phe Gly Thr He He Asp Leu Gly Thr Ser Arg Ser He 
145 150 155 160 

Leu Pro Gin Leu Leu Arg Thr Asn Ser Val Val Ser Lys Cys Thr Ser 
165 170 175 

Glu Ser Ser Cys Arg 
180 



604 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

<C) ST HANDEDNESS : unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/ KEY: Protein 

(B) LOCATION : 1 . .182 

(D) OTHER INFORMATION :/note= "FIGURE 7H" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



Phe Thr lie Pro Leu Ala Glu Gin Asp Cys Glu Lys Phe Ala Phe Thr 
15 io is 

lie Pro Ala He Asn Asn Lys Glu Pro Ala Thr Arg Phe Gin Trp Lys 
20 25 30 

Val Leu Pro Gin Gly Met Leu Asn Ser Pro Thr He Cys Gin Thr Phe 
35 40 45 

Val Gly Arg Ala Leu Gin Pro Val Arg Asp Lys Phe Ser Asp Cys Tyr 
50 55 60 

He He His Tyr Phe Asp Asp He Leu Cys Ala Ala Glu Thr Lys Asp 
« 70 75 80 

Lys Leu He Asp Cys Tyr Thr Phe Leu- Pro Ala Glu Val Ala Asn Ala 
85 90 95 

Gly Leu Ala He Ala Ser Asp Lys He Gin Thr Ser Thr Pro Phe His 
100 105 no 

Tyr Leu Gly Met Gin He Glu Asn Arg Lys He Lys Pro Gin Lys He 
115 120 125 

Glu He Arg Lys Asp Thr Leu Lys Thr Leu Asn Asp Phe Gin Lys Leu 
130 135 140 

Leu Gly Asp He Asn Trp He Arg Pro Thr Leu Gly He Pro Thr Tvr 
145 150 155 . 160 

Ala Met Ser Asn Leu Phe Ser He Leu Arg Gly Asp Ser Asp Leu Asn 
165 170 175 

Ser Lys Arg Met Leu Thr 
180 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 base pairs 
(BJ TYPE: nucleic acid 
(C) STRANDEDNESS: single 



* * 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "retroviral DMA" 



(ix) FEATURE : 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION: 1. .250 

(D) OTHER INFORMATION :/note= "FIGURE 8A" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 0: 
GTAAAT GACA CCTATGATGC ACTGCCACCC TTTCACTGTT TCACCCTGAA CATCTGCTTT 
TTACATCTAA GTGATTGTAC CCAATAAATA GTGTGGAGAC CAGAGCTCTG AGC C TTT T GC 
AGCCTCCATT TTGCAACTGG TCCCCTGGCT CCCACCTTTA TGAACTCTTA ACCTGT CT T T 
TCTCATTCCT TTGTCACCAT TGGACTTTGG GTACCCTACG GGTGGTGTTG AGGCTGTCAC 
CGCACATTAA 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 
.(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "RETROVIRAL DNA** 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1. .203 

(D) OTHER INFORMATION: /not e= "FIGURE 8B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GTTTAGTTAA TCTATAATCT ATAGAGACAA TGCTTATCAC TGGCTTGCTG TCAATAAATA 
TGTGGGTAAA TCTCTGTTCA AGACTCT CAG CTTTGAAGCT GTGAGACCCC TGATTTCCCA 
CTCCACACCT CTATATTTCT GTGTGTGTGT CTTTAATTCC TCCAGTGTTG CTGGGTTAGG 
GTCTCCTCGA CGAGCTGTCG TGC 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS z single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "RETROVIRAL DMA** 

(ix) FEATURE: 

. (A) NAME/ KEY: misc_f eature 
<B) LOCATION: 1. .283 

(D) OTHER INFORMATION :/note= "FIGURE 8C" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
AACTCAGCTG CTGCACAGTG GTCGAGCCTC CAGAGCT CAT GCCATT GCAG TGGTCAGAGC 
CTGGCCCTCC TCTTCCTGCA TAGAACCTGG ATTCAATCTG TAAGGTGGGA AGTGCAGCAG 
CAGAGAACTC TGGCCTTGCA GAGAGTCCCT GTTCCCACTT CACTTTCCTT TTCACCAAAT 
AAAACCCTGC TTTCACTCAT GCATCAAATT GTCTGTGAGC CTACATTTTT GTGGCCATGG 
GACAAGAACA CCATCTTTAG CTGAGCTAGG GAAAAGTCCT GCA 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "retroviral DNA" 

(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION: 1. .245 

(D) OTHER INFORMATION : /notes "FIGURE 8D" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GATGTGACCA CTGTGACCTA CCTACACTGG AGATGGCTCA CACTTCCTTA CCCTTCCCCT 
GCTGTACCAA TAAATAACAG CACAGCCTGA CATTCGGAGC CATTACCGGT CTTTGTGACT 
TGGTGGTAGT GGTATCCCCT AGGGCCCAGC TGTCTTTTCT TTTATCTCTT TGTCTTGTGT 
CTTTATTT CT ATGAGTCTCT CGTCTCCGCA CATGGGGAGA AAAACCCATA GACCCTGTAG 
GGCTG 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 base pairs 

(B) TYPEr nucleic acid 
<C) STRANDEDNESS: single 



60 
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(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "retroviral DNA" 

(ix) TEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION: 1. .181 

(D) OTHER INFORMATION: /note= "FIGURE 8E" 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
CTCACAAAAA TAATAAAAGC TTCTGTTGGC CATTCTTCAG ATCTTCATCT CTTGTGAGGA 
TCCCCCTGTA CATGTAAAAA TGTAATAAAA CTTGTATCCT , TTCTCCTCTT AATCTGTCTT 
GCATCAATAT CATTC CTAGA CCCAGTCAGA GATGGGTGGA GGTGAGCCGT ACATTTCCCT 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "RETROVIRAL DNA" 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. .287 

(D) OTHER INFORMATION : /note° "FIGURE 8F" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
CAGAGAACT C CAGCCAGCTG TGATGGAGCC TCAGGAAGTT CACAGTTGCA GCAGGAAGGA 
GCCTGGCTGC TCCTCTTCCT GTGTGGAACC T GGGATTAGA ACAGGCTGGC AGGAAGTGCT 
TTAGCAGGGA CTCTGGCCTA CTCACACTCC TTGTTTCCCC CCTTTCTTCC TTTTCACTCA 
ATAAAGCCCT GTCTTACTCA CCATTCAAAT TGTCTGTGAG CCTGAATTTT CATGGCTGTG 
GGACAAAGAA CCCTATTTTT AGCT GAACTA AGGAAAATTC CTGCAAA 



(2) INFORMATION FOR SEQ ID NO; 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "RETROVIRAL DNA" 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1. .264 

(D) OTHER INFORMATION : /note*" "FIGURE 8G" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GTGATTGTCT GCTGACCCTC TCCCCACAAT TGTCTTGTGA CCCTGACACA TCCCCCTCTT 
CGAGAAACAC CCGCGGATGA TCAATAAATA TTAAGGGAAC TCAGAGGCTG GCAGGATCCT 
CCATATGCTG AACGCTGGTT GCCCCGGGTC CCCTTCTTTC TTTCTCTATA CTTTGTCTCT 
GTGTCTTTTT CTTTTCCAAA TCTCTCGTCC CACCTTACGA GAAACACCCA CAGGTGTGTC 240 
CGGGCAACCC AACGCCACAT AACA 

264 



60 

120 

180 



